Hard Wall Stochastic Control based on Hallucination-EM and Power-EP

نویسنده

  • Max Welling
چکیده

We study stochastic control problems in the presence of hard wall constraints. Walls are incorporated in the dynamics of the agent by restricting its domain and hence perturbing the noise process close to the walls. A novel penalty term is introduced for bouncing off a wall. To efficiently search for a good policy we propose the “hallucination expectation maximization” algorithm which iteratively maps the problem onto a non-Gaussian dynamical system. Hallucination weights anaesthetize the agent to render its local decisions optimal for the global planning problem. The E-step of HEM is solved using power-EP.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EP for Efficient Stochastic Control with Obstacles

We address the problem of continuous stochastic optimal control in the presence of hard obstacles. Due to the non-smooth character of the obstacles, the traditional approach using dynamic programming in combination with function approximation tends to fail. We consider a recently introduced special class of control problems for which the optimal control computation is reformulated in terms of a...

متن کامل

Scenario based technique applied to photovoltaic sources uncertainty

There is an increasing need to forecast power generated by photovoltaic sources in day-ahead power system operation. The electrical energy generated by these renewable sources is an uncertain variable and depends on solar irradiance, which is out of control and depends on climate conditions. The stochastic programming based on various scenarios is an efficient way to deal with such uncertaintie...

متن کامل

ارتباط باورهای فراشناختی با نشانه‌های مثبت و منفی در بیماران اسکیزوفرنی

The aim of present rssearch was to determine the relationship between meta-cognitive beliefs and chizophrenic positive and negative symptoms of patients with hallucination and delusion. The Sample consisted of 127 patients with schizophrenia under therapy who were referred to Psychaitric Department of Emmam-Hossin hospital as outpatients or inpatients in the first quarter of the year 2005. Part...

متن کامل

Operation Planning of Wind Farms with Pumped Storage Plants Based on Interval Type-2 Fuzzy Modeling of Uncertainties

The operation planning problem encounters several uncertainties in terms of the power system’s parameters such as load, operating reserve and wind power generation. The modeling of those uncertainties is an important issue in power system operation. The system operators can implement different approaches to manage these uncertainties such as stochastic and fuzzy methods. In this paper, new ...

متن کامل

Application of Stochastic Programming to Determine Operating Reserves with Considering Wind and Load Uncertainties

Wind power generation is variable and uncertain. In the power systems with high penetration of wind power, determination of equivalent operating reserve is the main concern of systems operator. In this paper, a model is proposed to determine operating reserves in simultaneous market clearing of energy and reserve by stochastic programming based on scenarios generated via Monte Carlo simulation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008